WHAT IS CLAIMED IS: 

1 . A speech synthesis system, which synthesizes speech using 
time series data of formant parameters (including a formant 
frequency and a formant bandwidth) estimated based on a speech 
production model, the speech synthesis system comprising 
determining the correspondence of formant parameters between 
adjacent frames using dynamic programming. 

2. The speech synthesis system of Claim 1, wherein in 
determining the correspondence of the formant parameters, a 
connection cost do{F(n) , F(n+1) ) and a disconnection cost dd{F(k) ) 
are obtained using the equations: 

{F(n),F(n + 1)) = cc\f^ («) - Ff {n +1)| + ^\F. («) - F,{n +1)| 

d, {F{k)) = «|f, (k) - F, ik)\ + p\F, (k) - e\ 
= /3\FXk)-e\ 

where a and pare predetermined weight coefficients, Ff(n) is a 
formant frequency in the n*^ frame, that Fi(n) is a formant intensity 
in the n^"^ frame and e is a predetermined value, and the resultant 
dc(F(n) , F(n+1) ) anddd(F{k) ) are used as costs for grid point shifting 
in dynamic programming. 

3. The speech synthesis system of Claim 2, wherein for two 
adjacent frames in which exists a formant which has no counterpart 
to be connected, 

a formant having the same frequency as that of the disconnected 
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formant in one of the frames and an intensity of 0 is located in 
the other frame and 

the two adjacent frames are connected by interpolation of 



smooth function. 

4 . The speech synthesis system of Claim 2 , wherein the formant 
intensity Fi(n) is calculated using 



where Fb(n) is a formant bandwidth in the n''*' frame and is a sampling 
frequency . 



tract transfer function including a plurality of formants is 
implemented by a cascade connection of a plurality of filters and 

wherein when a formant which has no counterpart to be connected 
exists in the adjacent frames and thus the connection of the filters 
needs to be changed, 

a coefficient and an internally stored data of the filter 
in question are copied into another filter and 

the first filter is then over written with a coefficient and 
an internally stored data of still another filter or initialized 
to predetermined values . 



frequencies and intensities of both the formants according to a 




5. The speech synthesis system of Claim 3, wherein a vocal 
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6. The speech synthesis system of Claim A, wherein a vocal 
tract transfer function including a plurality of formants is 
implemented by a cascade connection of a plurality of filters and 

wherein when a formant which has no counterpart to be connected 
exists in the adjacent frames and thus the connection of the filters 
needs to be changed, 

a coefficient and an internally stored data of the filter 
in question are copied into another filter and 

the first filter is then over written with a coefficient and 
an internally stored data of still another filter or initialized 
to predetermined values. 

7 . A speech analys is method , in which a sound source parameter 
and a vocal tract parameter of a speech s ignal waveform are estimated 
by using a glottal source model including an RK voicing source 
model, the speech analysis method comprising the steps of: 

extracting an estimated voicing source waveform using a 
filter which is constituted by the inverse characteristic of an 
estimated vocal tract transfer function; 

estimating a peak position corresponding to a GCI {glottal 
closure instance) of the estimated voicing source waveform with 
higher accuracy at closer time intervals than that with the sampling 
period by applying a quadratic function; 

synthesizing the GCI with a sampling position in the vicinity 
of the estimated peak position and thereby generating a voicing 
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source model waveform; and 

time-shifting the generated voicing source model waveform 
with higher accuracy at closer time intervals than that with the 
sampling period by means of all pass filters and thereby matching 
the GCI with the estimated peak position. 

8. A speech analysis method, in which a voicing source 
parameter and a vocal tract parameter of a speech signal waveform 
are estimated by using a glottal voicing source model such as an 
RK model or a model defined as a modified model thereof, the speech 
analysis method comprising the steps of: 

extracting an estimated voicing source waveform using 
filters which are constituted by the inverse characteristic of 
an estimated vocal tract transfer function; and 

assuming the first harmonic level as Hi and the second 
harmonic level as H2 in DFT (discrete Fourier transformation) of 
the estimated voicing source waveform and estimating an OQ (open 
quotient) from a value for HD defined as HD=H2-H1. 

9. The speech analysis method of Claim 8, wherein for 
estimating the OQ, the relation: 

00=3 . 65HD-0 . 213HD^+0 , 0224ifD^+50 . 7 
is used. 
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